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Abstract 

Generalized quantum measurements with N distinct outcomes are used for determining 
the density matrix, of order d, of an ensemble of quantum systems. The resulting proba- 
bilities are represented by a point in an A-dimensional space. It is shown that this point 
lies in a convex domain having at most d 2 — 1 dimensions. 
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In elementary quantum measurement theory, a test performed on a quantum system is 
represented by a complete set of orthogonal projection operators P m , where the label m 
takes at most d different values (d is the dimensionality of the Hilbert space, assumed 
finite). The probability of obtaining outcome m of that test, following the preparation of 
a quantum ensemble in a state p, is p m = tr (pP m ) . If p is arbitrary, the only constraint 
on these probabilities is J2 m Pm — 1- 

It is well known that this type of test is not optimal if only a finite number of quantum 
systems can be observed. (As a concrete example, we receive five photons from a distant 
source, and we want a good estimate of their polarization. What is the best strategy?) 
In such a case, more information may be derived from a positive operator valued measure 
(POVM) [1, 2] with N > d different outcomes. Such a POVM is a set of N positive 
matrices A M , which in general do not commute, but still satisfy X^A^ = 1, where 1 is 
the unit matrix in d dimensions, and p is an arbitrary label running from 1 to N. If the 
quantum system is prepared in state p, the probability to get outcome p is 



To each preparation p of the system, we thus associate iV probabilities, pi, p%,. ■ ■ ,Pn- 
We refer to this set of positive numbers as a point P(p) in probability space. The set of all 
output points is labelled by P. Density matrices form a convex set whose extreme points 
are pure states [3]. The linear relation flU) between input states and output probabilities 
implies that the set of points P is also convex: 



Thus the shape of the hypersurface that bounds the domain of the points P(p), for all 
possible preparations of the system, is determined by the outputs for the pure states only. 

Obviously X^£V* = 1, so that the points P(p) lie on a hyperplane of dimension (N — 1). 
However, the results of generalized measurements are subject to stronger constraints 
(which may be important for the statistical analysis of experimental results). Let D 
be the number of linearly independent parameters in p (for a generic density matrix in a 
(i-dimensional complex Hilbert space, D = d 2 — 1). The following proposition will now 
be proved: If N > (D + 1), the output of any POVM is confined to a D- dimensional 
subspace. 



Pi*(p) = p( a ^\p) = tr(pA M ). 



(1) 



P(p) EE P(x Pl + (1 - X)p 2 ) = XP( P1 ) + (1 - X)P( P2 ). 



(2) 
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Indeed, let us write the elements of a generic density matrix in terms of real (symmetric) 
and imaginary (antisymmetric) parts, 



Pmn ^mn + irjmn- (3) 

There are d(d — l)/2 independent elements r\ mn and (d + 2)(d — l)/2 independent £ mn , 
because of the condition tr p = 1 which can be written 

d-i 

Zdd = l-J2 6m- (4) 

n=l 

Likewise, the elements of each POVM matrix A M , of order d, can be written as x mn + iy mn 
in terms of d 2 real parameters. We thus obtain from Eq. ([]]), 

d-l d 

P/j.{p) = { X mm ~~ X dd) 6mm + 2 ^ ^ (^mn^mn + UmnVmn) + ^rfrf- (5) 

m=l m=l n>m 

Thus -P(p) is obtained from p by an affine transformation [4] 

p = Mr + c, (6) 

where p is a 'vector' consisting of any N — 1 components (the remaining component is 
obtained from Y^^Pfj, — !)• Likewise r is a vector of D linearly independent parameters of 
p. The matrix M, with N — 1 rows and D columns, depends only on the POVM used for 
the test; and c is a vector whose N — 1 components are x^ d , which also are parameters of 
the POVM. Explicitly, the p-th row of M, which is 

( x n ~ x dd) ■ ■ ■ ( x d-i,d-i ~ x dd) 2xi2 . . . 2x^_ l d 2yi2 . . . 2y^_ l d , (7) 
and 

r T = {in ■■■ £d-i,d-i £12 ••• £d-i,d V12 ■■■ Vd-i,d), (8) 

have D = d 2 — 1 real components. 

If N — 1 > D, the rank of M is at most D, and any D + l vectors p = Mr are linearly 
dependent. A translation by the constant vector c in the (N — l)-dimensional vector 
space transforms a D-dimensional subspace into another Z)-dimensional subspace. Thus 
the output of any POVM on a system whose density matrix has D linearly independent 
parameters is confined to a D-dimensional subspace of the probability space. 
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Next, let us examine the shape of the surface that encloses the domain of P(p). The 
set of density operators, and therefore the set of probabilities, are convex. The extreme 
points of these sets are the pure states, which are defined by 2(d — 1) real parameters, 
and the probabilities corresponding to these pure states, respectively. Thus any interior 
point of the D-dimensional set P is a convex combination of the extreme points of that 
set, which lie on a 2(d — 1) -dimensional hypercurve. 

Note that any density matrix p of rank d can be written as a convex combination of no 
more than d pure density matrices, corresponding to the eigenvectors of p. As a result, 
any interior point of P can be obtained from at most d extreme points. This result ought 
to be compared with Caratheodory's theorem [4], which states that any interior point 
of an arbitrary convex set of dimension D can be obtained as a convex combination of 
D + 1 (or fewer) extreme points of that set. Here, D + 1 = d 2 . The smaller number of 
extreme points needed in the present case is due to the fact that density matrices are not 
an arbitrary convex set (they are positive and have unit trace). 

As a simple example, consider the case of spin-| systems. Their states can be described 
by means of a Bloch sphere. The pure states correspond to points on the surface of the 
sphere, and mixed states lie in its interior. With our notations, we have 

/ 

/ii / i ■"> i / lli ■"> \ 

(9) 



Xll X12 + t]Jl2 



where the three parameters are subject to the positivity condition 

xu (1 - in) - x\ 2 - y\ 2 > (10) 

The transformation (^) is linear. Therefore the Bloch sphere is transformed into another 
quadratic surface, usually an ellipsoid. Exceptionally, if a POVM element has unit norm 
(so that the corresponding can be equal to 1, and then all the other vanish), we 
have a cone. 

In particular, consider a POVM with four elements, = (l + a At -cr)/4, where the four 
unit vectors a M form a regular tetrahedron in a real 3-dimensional Euclidean space, and cr 
denotes the three Pauli matrices. Likewise, any state p can be written as p = (l + n-cr)/2. 
We thus have 

Vil = tr (pA„) = (1 + a /t • n)/4, (11) 
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whence 
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E(^-i) 2 = n2 /i2. (12) 

H=l 



The Bloch sphere is thus mapped into a 3-dimensional sphere of radius l/y/12, centered 
at p^ — | , and lying in the hyperplane J2^P^ — 1- If we want to parametrize that hyper- 
plane with three of the p M , we substitute in the above equation p 4 = 1 — p 1 — p 2 —pz- We 
then obtain an ellipsoid in a 3-dimensional space, as shown in Fig. 1. It is also possible 
to use as coordinates suitable linear combinations of the p^, orthogonal to J2^P^, such as 

x = pi + p 2 - p 3 - Pa, 

y = pi -P2 + P3 -p<t, ( 13 ) 

Z = Pi ~P2 -P3+P4- 

The Bloch sphere is then mapped into a sphere x 2 + y 2 + z 2 < |. 

The case of spin-1 systems is more complicated. A generic density matrix can be 
written in terms of its eigenstates as 

3 

p = E a >j>N- ( 14 ) 

This is as a convex combination of three extreme points. Any pure state, such as the 
above eigenstates, can be parametrized, with a suitable choice of its phase, as 

\v) = (sin#cos0e ia ,sin#sin0e l/3 ,cos#), (15) 

where 

< 9, <f)< tt/2, and < a, (3 < 2tt. (16) 

All the components of the corresponding pure p, which is a matrix of rank 1, are functions 
of the four parameters 6, <p, a, and f3. Thus all the probabilities p^ = tr (pA M ) are also 
functions of these four angles. This gives the extreme points of the set P: they form a 
four-parameter hypersurface in an eight-dimensional space S (which is itself embedded 
in the A^-dimensional space of the p^). The rest of the boundary of S, corresponding to 
density matrices of rank two, lies on the segments between any pair of extreme points. 
The interior points of S can be obtained by a convex combination of three suitably chosen 
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extreme points, as in Eq. (0). All these considerations are readily extended to quantum 
systems whose Hilbert spaces have more than three dimensions: there are (d — 1) polar 
angles like 9 and 0, and (d — 1) phases like a and (3. 

Finally, let us consider potential applications of the above results to the analysis of 
experimental data. The probabilities p M cannot be measured exactly, as this would require 
testing an infinite number of quantum sytems. If only n systems are available, and the 
/i-th outcome is found to occur times (so that the experimenter records a set of iV 
integers or zeros), then the N ratios = n^/n are the only data available for evaluating 
the true p^. Obviously, X^ 1 ?^ — 1 5 just like X^p^, but the other constraints on p^ may 
not be satisfied. In particular, if iV > D + 1, the point Q = {q^} will not in general lie in 
the hyperplane of dimension D to which the point P is restricted. 

How far can Q be from the true PI Each one of the experimental data n M has an 
expected binomial distribution with dispersion 

Arv = [np^l - p^)] 1 ' 2 ~ [n^n - n^/n} 1 ' 2 , (17) 

where the last expression is valid if ^> 1. We can imagine an error box with sides 
equal to Aq^, centered at the point Q, and we then have to examine where that error box 
overlaps with the hyperplane to which P is constrained. 

Obviously, it is best to design the experiment so as to have N = D + 1 = d 2 , and not 
more than that. A larger value of N leads to a less efficient use of the experimental data. 
This result is reminiscent of Davies's theorem [5] which deals with a related question, 
namely how to maximize the mutual information obtainable from a set of non-orthogonal 
signals. The theorem asserts that no more than d 2 outputs are needed. 

If N — D + 1, the only question is whether Q lies in the convex domain of P, namely 
whether the resulting p, obtained by solving Eq. ([J), is a positive matrix. If it is, then 
Q is the best estimate for the true P. In the opposite case, we may consider whether 
the error bars Ag M reach the convex domain of P. If they do, we still get a reasonable 
estimate. If even this fails, the experimenter has to test a larger number of samples in 
order to have a meaningful measurement. 
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FIG. 1. The probability ellipsoid that corresponds to the Bloch sphere is tangent to the 
plane p\ + p 2 + P3 = 1 (that is p^ = 0) at the point pi = p 2 = P3 = | and likewise it 
is tangent to each coordinate plane at the point where the two coordinates in that plane 
are |. 
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